Which Space Partitioning Tree to Use for Search - Summary
نویسنده
چکیده
Trees like binary-space-partitioning trees, kd-trees, principal axis trees and random projection trees are used to answer the question ”which tree to use for nearest-neighbor search?.” This paper deals with the influence of the vector quantization performance of the trees on the search performance and the margins of the partitions in these trees. Theoretical results show that both factors have an impact on search performance. 1 Nearest-neighbor search The most common techniques for nearest-neighbor search are the search with hierarchical tree indices and search with hash-based indices. Multidimensional binary space-partitioning trees like kd -trees[1] are used very often for nearest neighbor search. The time it takes to find the nearest neighbour in low dimensions is O(log(n)), but with increasing dimensionality the worst case for the search can be every node in every dimension which basically means O(n). Real data has low ”intrinsic” dimensionality. This is why binary spacepartitioning trees in high dimensions are said to perform worse even though there are no scientific results to support this assumption. This paper presents theoretical results which link the search performance of BSP-trees to properties of the data and the tree. In this way individual factors that affect the search performance of the BSP-trees can be dealt with.
منابع مشابه
OPTIMIZATION OF SKELETAL STRUCTURES USING IMPROVED GENETIC ALGORITHM BASED ON PROPOSED SAMPLING SEARCH SPACE IDEA
In this article, by Partitioning of designing space, optimization speed is tried to be increased by GA. To this end, designing space search is done in two steps which are global search and local search. To achieve this goal, according to meshing in FEM, firstly, the list of sections is divided to specific subsets. Then, intermediate member of each subset, as representative of subset, is defined...
متن کاملBall*-tree: Efficient spatial indexing for constrained nearest-neighbor search in metric spaces
Emerging location-based systems and data analysis frameworks requires efficient management of spatial data for approximate and exact search. Exact similarity search can be done using space partitioning data structures, such as KD-tree, R*-tree, and ball-tree. In this paper, we focus on ball-tree, an efficient search tree that is specific for spatial queries which use euclidean distance. Each no...
متن کاملDIAGNOSIS OF BREAST LESIONS USING THE LOCAL CHAN-VESE MODEL, HIERARCHICAL FUZZY PARTITIONING AND FUZZY DECISION TREE INDUCTION
Breast cancer is one of the leading causes of death among women. Mammography remains today the best technology to detect breast cancer, early and efficiently, to distinguish between benign and malignant diseases. Several techniques in image processing and analysis have been developed to address this problem. In this paper, we propose a new solution to the problem of computer aided detection and...
متن کاملWhich Space Partitioning Tree to Use for Search?
We consider the task of nearest-neighbor search with the class of binary-spacepartitioning trees, which includes kd-trees, principal axis trees and random projection trees, and try to rigorously answer the question “which tree to use for nearestneighbor search?” To this end, we present the theoretical results which imply that trees with better vector quantization performance have better search ...
متن کاملCopy-move Detection Algorithm Efficiency Increase Using Binary Space Partitioning Trees
Duplicates embedding is one of the most frequently used type of digital image change. The copy-move procedure includes copying an image fragment from one part and pasting it to another part of the same image. Moreover, this fragment can be changed using some transformations, like affine or contrast enhancement. Existing copy-move detection methods consist of two main steps: feature calculation ...
متن کامل